On the Intrinsic Locality Properties of Web Reference Streams
نویسندگان
چکیده
There has been considerable work done in the study of Web reference streams: sequences of requests for Web objects. In particular, many studies have looked at the locality properties of such streams, because of the impact of locality on the design and performance of caching and prefetching systems. However, a general framework for understanding why reference streams exhibit given locality properties has not yet emerged. In this paper we take a first step in this direction. We propose a framework for describing how reference streams are transformed as they pass through the Internet, based on three operations: aggregation, disaggregation, and filtering. We also propose metrics to capture the temporal locality of reference streams in this framework. We argue that these metrics (marginal entropy and interreference coefficient of variation) are more natural and more useful than previously proposed metrics for temporal locality; and we show that these metrics provide insight into the nature of reference stream transformations in the Web.
منابع مشابه
Locality Characteristics of Web Streams Revisited
This paper studies locality of reference properties of Web streams using a recently proposed AggregationDisaggregation-Filtering framework. Two primary research questions are addressed: 1) What impact does locality of reference have on caching performance? and 2) What are the locality characteristics of streams that result from aggregation of filtered streams? Trace-driven simulations are used ...
متن کاملCharacterizing Reference Locality in the WWW
As the World Wide Web (Web) is increasingly adopted as the infrastructure for large-scale distributed information systems, issues of performance modeling become ever more critical. In particular, locality of reference is an important property in the performance modeling of distributed information systems. In the case of the Web, understanding the nature of reference locality will help improve t...
متن کاملGreedyDual* Web caching algorithm: exploiting the two sources of temporal locality in Web request streams
The relative importance of long-term popularity and short-term temporal correlation of references for Web cache replacement policies has not been studied thoroughly. This is partially due to the lack of accurate characterization of temporal locality that enables the identi cation of the relative strengths of these two sources of temporal locality in a reference stream. In [21], we have proposed...
متن کاملSources and Characteristics of Web Temporal Locality
Temporal locality of reference in Web request streams emerges from two distinct phenomena: the long-term popularity of Web documents and the short-term temporal correlations of references. I n this paper we show that the commonly-used distribution of inter-request times is predominantly determined by the power law governing the long-term popularity of documents. This inherent relationship tends...
متن کاملModeling strength of locality of reference via notions of positive dependence
The performance of demand-driven caching depends on the locality of reference exhibited by the stream of requests made to the cache. In spite of numerous efforts, no consensus has been reached on how to formally compare streams of requests on the basis of their locality of reference. We take on this issue by introducing the notion of Temporal Correlations (TC) ordering for comparing strength of...
متن کامل